Opinion finding in blogs: a passage-based language modeling approach

نویسندگان

  • Malik Muhammad Saad Missen
  • Mohand Boughanem
  • Guillaume Cabanac
چکیده

In this work, we propose a Passage-Based Language Modeling (LM) approach for Opinion Finding in Blogs. Our decision to use Language Modeling in this work is totally based on the importance of passages in blogposts and performance LM has given in various Opinion Detection approaches. In addition to this, we propose a novel method for bi-dimensional Query Expansion with relevant and opinionated terms using Wikipedia and Relevance-Feedback mechanism respectively. Besides all this, we also compare the performance of three Passage-based document ranking functions (Linear, Avg, Max). For evaluation purposes, we use the data collection of TREC Blog06 with 50 topics of TREC 2006 over TREC provided best baseline with opinion finding MAP of 0.3022. Our approach gives a MAP improvement of almost 9.29% over best TREC provided baseline (baseline4).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fusion Approach to Finding opinions in Blogosphere

In this paper, we describe a fusion approach to finding opinion about a given target in blog postings. We tackled the opinion blog retrieval task by breaking it down to two sequential subtasks: ontopic retrieval followed by opinion classification. Our opinion retrieval approach was to first apply traditional IR methods to retrieve on-topic blogs, and then boost the ranks of opinionated blogs us...

متن کامل

WIDIT in TREC 2007 Blog Track: Combining Lexicon-Based Methods to Detect Opinionated Blogs

In TREC-2007, Indiana University‟s WIDIT Lab 1 participated in the Blog track‟s opinion task and the polarity subtask. For the opinion task, whose goal is to "uncover the public sentiment towards a given entity/target", we focused on combining multiple sources of evidence to detect opinionated blog postings. Since detecting opinionated blogs on a given topic (i.e., entity/target) involves not o...

متن کامل

WIDIT in TREC 2006 Blog Track

Web Information Discovery Integrated Tool (WIDIT) Laboratory at the Indiana University School of Library and Information Science participated in the Blog track’s opinion task in TREC2006. The goal of opinion task is to "uncover the public sentiment towards a given entity/target", which involves not only retrieving topically relevant blogs but also identifying those that contain opinions about t...

متن کامل

Polarity Detection in Blog Comments from Blog Rss Feed by Modified TF - IDF Algorithm

412 | P a g e ABSTRACT Blogs are most common medium over web where user posts their opinion. It is considered to be a web space of the users where they share their views, beliefs and other philosophy. Blogs posted across the web can be extracted from their rss feed. Once a blog is posted, several readers leaves their comment on the blogs. Analyzing these comments can help in finding the opinion...

متن کامل

Second language Writing Through Blogs: An Investigation of Learner Autonomy

Employing an explanatory sequential design, the present study investigated the effect of English as a Foreign Language (EFL) blog-mediated writing instruction on the students’ learner autonomy. A number of 46 learners who were the students of two intact classes were randomly assigned to control and experimental groups.  Over a 16-week semester, the control group students (n = 21) were taught ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010